Identifying clusters of functionally related genes in genomes

نویسندگان

  • Gangman Yi
  • Sing-Hoi Sze
  • Michael R. Thon
چکیده

MOTIVATION An increasing body of literature shows that genomes of eukaryotes can contain clusters of functionally related genes. Most approaches to identify gene clusters utilize microarray data or metabolic pathway databases to find groups of genes on chromosomes that are linked by common attributes. A generalized method that can find gene clusters regardless of the mechanism of origin would provide researchers with an unbiased method for finding clusters and studying the evolutionary forces that give rise to them. RESULTS We present an algorithm to identify gene clusters in eukaryotic genomes that utilizes functional categories defined in graph-based vocabularies such as the Gene Ontology (GO). Clusters identified in this manner need only have a common function and are not constrained by gene expression or other properties. We tested the algorithm by analyzing genomes of a representative set of species. We identified species-specific variation in percentage of clustered genes as well as in properties of gene clusters including size distribution and functional annotation. These properties may be diagnostic of the evolutionary forces that lead to the formation of gene clusters. AVAILABILITY A software implementation of the algorithm and example output files are available at http://fcg.tamu.edu/C_Hunter/.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Detecting gene clusters under evolutionary constraint in a large number of genomes

MOTIVATION Spatial clusters of genes conserved across multiple genomes provide important clues to gene functions and evolution of genome organization. Existing methods of identifying these clusters often made restrictive assumptions, such as exact conservation of gene order, and relied on heuristic algorithms. RESULTS We developed a very efficient algorithm based on a 'gene teams' model that ...

متن کامل

The Clusters of Transmembrane Protein Genes in Prokaryotic Genomes

It is known that genes which code functionally related proteins tend to be closely located each other on genomes. This property helps us to research functionally unknown genes. Transmembrane proteins (TMPs), which account for 20-30% of proteomes, exist in the biological membrane and have essential roles in transmission of materials and information through the membrane. We are interested in how ...

متن کامل

Transmembrane-Protein-Gene Clusters in Prokaryotic Genomes

With the wealth of complete genome sequences, a large number of genes have been identified. It is reported that the genes coding transmembrane proteins (TMPs) tend to be located on genome in dense cluster [4], some of closely located TMPs seem to be simultaneously expressed, or functionally related. In this study, we analyzed 46 complete prokaryotic genomes by expressing them as gene sequences ...

متن کامل

An Improved Model for Gene Cluster Inference

Inferring functionally related genes in microbial genomes is an important problem, which has been addressed by various gene cluster detection methods. Existing models well capture genomic inversions, gene duplications and insertions across a pair of genomic regions. Clusters involving multiple regions are indirectly inferred through constituent pairs. In this paper, we improve upon current work...

متن کامل

Automatic screening for groups of orthologous genes in comparative genomics using multiple-component clustering

To understand evolutionary relationships among genes from different organisms is a problem in modeling evolutionary history while solving practical problems related to functional annotation of genes. We have developed automatic method for discovering groups of gene sequences present in different organisms that are functionally related through evolution. We have developed a new clustering method...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 23 9  شماره 

صفحات  -

تاریخ انتشار 2007